goldilock zone
Deconstructing the Goldilocks Zone of Neural Network Initialization
Vysogorets, Artem, Dawid, Anna, Kempe, Julia
The second-order properties of the training loss have a massive impact on the optimization dynamics of deep learning models. Fort & Scherlis (2019) discovered that a high positive curvature and local convexity of the loss Hessian are associated with highly trainable initial points located in a region coined the "Goldilocks zone". Only a handful of subsequent studies touched upon this relationship, so it remains largely unexplained. In this paper, we present a rigorous and comprehensive analysis of the Goldilocks zone for homogeneous neural networks. In particular, we derive the fundamental condition resulting in non-zero positive curvature of the loss Hessian and argue that it is only incidentally related to the initialization norm, contrary to prior beliefs. Further, we relate high positive curvature to model confidence, low initial loss, and a previously unknown type of vanishing cross-entropy loss gradient. To understand the importance of positive curvature for trainability of deep networks, we optimize both fully-connected and convolutional architectures outside the Goldilocks zone and analyze the emergent behaviors. We find that strong model performance is not necessarily aligned with the Goldilocks zone, which questions the practical significance of this concept.
Cloud removal Using Atmosphere Model
Cloud removal is an essential task in remote sensing data analysis. As the image sensors are distant from the earth ground, it is likely that part of the area of interests is covered by cloud. Moreover, the atmosphere in between creates a constant haze layer upon the acquired images. To recover the ground image, we propose to use scattering model for temporal sequence of images of any scene in the framework of low rank and sparse models. We further develop its variant, which is much faster and yet more accurate. To measure the performance of different methods {\em objectively}, we develop a semi-realistic simulation method to produce cloud cover so that various methods can be quantitatively analysed, which enables detailed study of many aspects of cloud removal algorithms, including verifying the effectiveness of proposed models in comparison with the state-of-the-arts, including deep learning models, and addressing the long standing problem of the determination of regularisation parameters. The latter is companioned with theoretic analysis on the range of the sparsity regularisation parameter and verified numerically.
The Hunt for the 'Goldilocks Zone' in Tech and Data Policy - InformationWeek
There is some truth in the notion that the enterprise precedes consumer technology breakthroughs, but there are shared concerns about policy found in both communities. At the CES Unveiled preview event held in New York, the Consumer Technology Association gave its usual glimpse of gadgets expected to be in demand in the coming year however legislative matters colored the conversation. Gary Shapiro, president and CEO of the CTA, spoke about his concerns and observations of the current climate surrounding the entire technology sphere and the effect proposed policies might have. CES Unveiled is the preamble to the annual CES trade conference held in January in Las Vegas. While CES is primarily a showcase for new televisions, smart devices, cloud-based gaming, and tricked out automated cars, there is some crosspollination with elements of enterprise technology with planned discussions on innovation policy, privacy, and transformation across industries. For instance, the CTA openly stated at CES Unveiled that the enterprise will lead the way for 5G wireless connectivity, setting the stage before consumers get a taste.
Finding the Goldilocks Zone for Applied AI โ Zetta Venture Partners โ Medium
This article originally appeared in TechCrunch. While Elon Musk and Mark Zuckerberg debate the dangers of artificial general intelligence, startups applying AI to more narrowly defined problems such as accelerating the performance of sales teams and improving the operating efficiency of manufacturing lines are building billion-dollar businesses. Narrowly defining a problem, however, is only the first step to finding valuable business applications of AI. To find the right opportunity around which to build an AI business, startups must apply the "Goldilocks principle" in several different dimensions to find the sweet spot that is "just right" to begin -- not too far in one dimension, not too far in another. Here are some ways for aspiring startup founders to thread the needle with their AI strategy, based on what we've learned from working with thousands of AI startups.
The Goldilocks zone: Towards better understanding of neural network loss landscapes
Fort, Stanislav, Scherlis, Adam
We explore the loss landscape of fully-connected neural networks using random, low-dimensional hyperplanes and hyperspheres. Evaluating the Hessian, $H$, of the loss function on these hypersurfaces, we observe 1) an unusual excess of the number of positive eigenvalues of $H$, and 2) a large value of $\mathrm{Tr}(H) / |H|$ at a well defined range of configuration space radii, corresponding to a thick, hollow, spherical shell we refer to as the \textit{Goldilocks zone}. We observe this effect for fully-connected neural networks over a range of network widths and depths on MNIST and CIFAR-10 with the $\mathrm{ReLU}$ non-linearity. The effect is not observed for the $\tanh$ non-linearity. Using our observations, we demonstrate a close connection between the Goldilocks zone, measures of local convexity/prevalence of positive curvature, and the suitability of a network initialization. We show that the high and stable accuracy reached when optimizing on random, low-dimensional hypersurfaces is directly related to the overlap between the hypersurface and the Goldilocks zone. We note that common initialization techniques initialize neural networks in this particular region of unusually high convexity, and offer a geometric intuition for their success. We take steps towards an analytic description of the general features of the loss function geometry, exploring its anisotropy and strong radial dependence. We support our theoretical results with experiments. Furthermore, we demonstrate that initializing a neural network at a number of points and selecting for high measures of local convexity such as $\mathrm{Tr}(H) / |H|$, number of positive eigenvalues of $H$, or low initial loss, leads to statistically significantly faster training on MNIST. Based on our observations, we hypothesize that the Goldilocks zone contains a high density of suitable initialization configurations.
Dwarf Planets, Water Plumes, and Bouncy Castles in Orbit: All the Best Space Stuff From 2016
This year, we kicked things off by telling you how deadly and difficult space is to explore. It can kill you with radiation, giant flying space rocks, and regular old time. And those are just a few of its weapons. But while space is a pretty dangerous place, it's also incredibly inspirational. If you take science fiction as your model--which we often do--people are at their best when faced with a seemingly insurmountable challenge.